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REDUCING POWER CONSUMPTION OF AN clock signal includes disabling the clock signal and reducing 

ELECTRONIC DEVICE toe leakage power consumption of the device. 

BACKGROUND DESCRIPTION OF THE DRAWING 

This invention relates generally to reducing the amount of 5 FIG. 1 is a block diagram showing a processor core in 

power consumed in a reduced dynamic power consumption accordance with one embodiment of the present invention; 

state by a variety of electronic devices including those that FIG. 2 is a block diagram of a computer system including 

use battery power sources, such as portable computers. the processor illustrated in FIG. 1; 

In devices such as computers that may be operated from 10 FIG. 3 shows a block diagram of a power management 

a battery, it is important to reduce the power consumption to control circuit in accordance with one embodiment of the 

the greatest possible extent. The usefulness of battery oper- present invention; 

a ted devices is reduced if the battery must be recharged FIG. 4 is a chart showing the different prior art power 

frequently. A variety of techniques are known for reducing consumption states; 

the dynamic power consumption. For example the Advanced 15 piG. 5 shows a flow for reducing leakage power con- 
Configuration and Power Interface (ACPI) Specification, sumption; and 

(Rev. 1.0, Dec. 22, 1996) sets forth information about how pj G 6 sbaws a methodology for transitioning from a 

to reduce the dynamic power consumption of portable and reduced leakage power consumption state, 
other computer systems. 

With respect to the microprocessors used in computer 20 DETAILED DESCRIPTION 

systems, four processor power consumption states (C0-C3) The power consumption of an electronic device may be 

are defined in the ACPI Specification. When the processor is ma£ je up 0 f ^ components. The dynamic power consump- 

executing instructions it is in a CO state. There are three ^on rclatcs t0 ^ t power that is consumed when the device 

non-executing states (C1-C3). In a working computer ^ operating. In connection with processors, the dynamic 

system, the operating system dynamically transitions idle 25 power consumption occurs when the processor's clocks are 

processors into the appropriate power consumption state. operating. The leakage power consumption may occur when 

State CI is the processor power state with the lowest the device is not operating and power continues to be 

latency. Basically, aside from putting the processor in a consumed based on the leakage current which flows through 

non-executing power state, the CI state has no other soft- the transistors, in the off state, that make up the electronic 

ware visible effects. 30 device. 

The C2 power state offers improved power savings over Leakage power consumption may derive from weak 
the CI state. Like the CI state, aside from putting the inversion and the finite impedance between the source and 
processor in a non-executing power state, the state has no drain of complementary metal oxide semiconductor 
other software visible effects. In the C2 power state, the (CMOS) transistors when the transistors are in the off state, 
processor is still able to maintain the context of the system 35 This leakage current, which is sometimes also called sub- 
caches, threshold leakage, has an exponential dependency on thresh- 

The C3 power state offers still lower dynamic power old voltage divided by thermal energy (kT). 

consumption compared to the CI and C2 states. While in the The threshold voltage may be determined by process 

C3 state, the processor's caches are maintained but snoops conditions, temperature and channel length. As a rule of 

are ignored. The operating system software is responsible 40 thumb, the leakage current doubles for every 15° C. increase 

for ensuring that cache coherency is maintained. In the C3 in temperature. Moreover, as process technologies continue 

state, the processor may not be able to maintain coherency to scale, the threshold voltage and channel length become 

of the processor caches with respect to other system activi- smaller. As a result, the leakage current, and thus the leakage 

ties. The C3 power consumption state uses less power but power consumption, increases for each process generation, 

has a higher exit latency than the C2 power state. Due to the short channel effect with sub-micron channel 

Generally, the C3 state uses one of two mechanisms to lengths, leakage current may have a sub-linear dependency 

maintain cache coherency. The operating system may flush on supply voltage. In other words, the leakage current 

and invalidate the caches prior to entering the C3 state. The decreases sub-linearly as supply voltage decreases, 

flushing of the caches may be provided through predefined SQ Gate leakage due to a tunneling effect may also cause 

ACPI mechanisms. Alternatively hardware mechanisms leakage current. The tunneling current has an exponential 

may be provided to prevent bus masters from writing to dependency on supply voltage. 

memory. In processors that use hardware mechanisms, the i n portable computers with battery operation, the 

bus masters may be disabled prior to entering the C3 state. increased leakage power consumption may have a direct 

When a bus master requests an access, the processor awak- 55 impact on battery life. Leakage current may be a limiting 

ens from the C3 state and re-enables bus master access. factor for threshold voltage scaling in battery operated 

While the reduced power consumption states outlined by devices, 

the ACPI Specification and known techniques have many Referring now to FIG. 1, an architecture for one illustra- 

advantages, there are instances where greater power con- tive embodiment of a microprocessor 12 implementing an 

sumption reductions may be desired. Thus, there is a con- 60 embodiment of the present invention includes a clock gen- 

tinuing need for ways to further reduce the power consump- erator 30 and a bus interface unit (BIU) 14. The BIU 14 is 

tion of computer systems and other components including coupled to cache and prefetch buffers 16 which may include 

devices which are operated from batteries. a branch target buffer (BTB). The caches may store instruc- 

lions and data for execution by the processor 12. Prefetch 

SUMMARY 65 DU ^ ers mav De coupled to a cache to enable prefetching of 

In accordance with one embodiment, a method of reduc- data and instructions from the cache or from the BIU 14 for 

ing the power consumed by an electronic device using a execution by the processor 12. 



05/12/2004, EAST Version: 1.4.1 



US 6,347379 Bl 

3 4 

The cache and prefetch buffers 16 may be coupled to an 72. The system bridge includes interface circuits to different 

instruction and data path section 18. The instruction and data ports, including a universal serial bus (USB) port 76 and 

path section 18 may include an instruction decoder that mass storage ports connectable to mass storage devices such 

decodes the incoming instructions. A microcode unit may as a hard disk drive. 

contain a memory which stores the microcode instructions 5 Other components coupled to the secondary bus 86 

for the processor. The data path is the main execution path include an input/output (I/O) circuit 90 that may couple to a 

for the processor. It may contain an arithmetic logic unit parallel port, serial port, floppy drive, and infrared port. A 

register file, barret shifter, constant read only memory and non-volatile memory 82 for storing basic input/output sys- 

flags. The processor may also include a floating point unit tern (BIOS) routines may be located on the bus 86, as may 

(not shown). 10 a keyboard device 92 and an audio control device 94, as 

The clock unit 30 generates the clock signals for the examples, 

processor 12. The clock unit 30 may, for example, generate Referring to FIG. 3, an illustrative power management 

the clock signals in response to an external frequency clock control logic according to an embodiment for controlling the 

input. The clock unit 30 supplies the clock signals to the BIU core clock frequency and the supply voltage level to the 

14 and to the remaining units of the processor as well. J5 processor 12 may be separated into a first portion 100 and a 

The clock unit 30 also includes control logic 22 for second portion 102. However, the first control logic portion 

controlling the operation of the clock unit 50. The control 100 may be included in the host bridge 58 and the second 

logic 22 may include a Vcc detector 24 which monitors the control logic portion 102 may be included in the system 

supply voltage level and issues a reset signal if the voltage bridge 74. Alternatively, the first and second control logic 

level becomes too low. The clock generator 12 also includes 20 portions may be implemented as separate chips, 

the phase locked loop (PLL) 42 which communicates with The control logic 100, 102 provide control signals to the 

the BIU 14. voltage regulator 52 to adjust its voltage levels and to the 

The frequency of the internal clock may be controlled processor 12 to adjust the processor's internal clock fre- 

through signals generated by an external clock 50 such as the quency. In addition, the power management control logic 

bus clock signal BCLK in processors made by Intel Corpo- 25 100, 102 may transition the processor 12 into a reduced 

ration. That is, the internal clock may be sped up or slowed power consumption state. 

down based on the input clock signal from the clock 26. A brief description of the interface signals between the 

In addition a voltage regulator 52 selectively provides one power management control logic 100, 102 and the other 

of two supply voltage levels to the clock unit 30 and the rest components of the system follows. The signal LO/HI# 

of the processor 12. The voltage regulator 52 may receive provided by the control logic 100 to the processor 12 

two reference voltages. The higher reference voltage may be determines if a core clock frequency of the processor 12 is 

a conventional external supply voltage in accordance with at a high or low level. As an example, the core clock 

the particular technology being utilized. The lower reference frequency may vary between 350 MHz and 450 MHz 

voltage may be the low power supply voltage to reduce „ depending on whether LO/HI# is active or not. Additional 

leakage power consumption. oils may be used to adjust the core clock frequency to more 

Referring to FIG. 2, an example computer system 10 ^° levek - t t . 

includes a processor 12 (e.g., an 80x86 or Pentium® pro- A signal VR_LO/HI# is provided by the control logic 

cesser from Intel Corporation) that receives an external portion 100 to the voltage regulator 52 to adjust the voltage 

clock BCLK (from a clock generator 50) and a supply 40 level ^PP 11 " 1 °y lhe vo,ta g e regulator 52. Additional bits 

voltage (from a voltage regulator 52). The voltage regulator other than VR_LO/HI# may also be used to provide addi- 

52 and the clock generator 50 are both controllable to adjust tional voltage levels. 

the core voltage levels as well as the core clock frequencies A signal G_STPCLK# may be provided to the processor 

in the processor 12, as further described below. 12 and a signal G_CPU_STP# may be provided to the 

The main power supply voltages in the computer system 45 clock generator 50 to place the processor 12 in a reduced 

10 are provided by a power supply circuit 56 that is coupled dynamic power consumption state (e.g., deep sleep, stop 

to a battery 60 and an external power source port 50. When grant, C2, or C3 state) so that the clock frequency and supply 

the external power source (not shown) is plugged in or v °ltag e » cvel of lhe processor 12 may be varied, 

removed, an interrupt (e.g., system management interrupt or A signal VRCHGNG# is provided by the control logic 

SMI) may be generated to notify system software of the 50 100 10 system electronics circuitry 101 (e.g., the host bridge 

external power source insertion or removal. In addition, 58 and system bridge 74) to indicate that the voltage level 

docking the computer system 10 to a docking base unit may from the voltage regulator 52 is changing. A signal VRP- 

also indicate a power source transition. In one embodiment, WRGD from the control logic portion 100 to system elec- 

a device driver may detect power source transitions and tronic circuitry (e.g., the host bridge 58 and system bridge 

docking events by registering with the operating system for 55 74 ) indicates when the output from the voltage regulator 52 

power and plug-and-play notifications, for example. is within specification. 

The processor 12 may be coupled to a cache memory 54 According to one embodiment of the invention, when the 

as well as to a host bridge 58 that includes a memory voltage regulator on signal (VR_ON) is active (which is 

controller for controlling system memory 55. The host true whenever the system is on), the voltage regulator 52 

bridge 58 may also by coupled to a system bus 72, which 60 settles to the output selected by VR_LO/HI# (a low level or 

may in one embodiment be a peripheral component inter- a high level). When the outputs of the regulator 52 are on 

connect (PCI) bus, as denned in the PCI Local Bus and within specification, the voltage regulator 52 asserts a 

Specification, Production Version, Rev. 2.1, published on signal VGATE, which in turn controls the state of the signal 

Jun. 1, 1995. The system bus 72 may couple other VRPWRGD provided by the control logic portion 100 to 

components, including a video controller 64 coupled to a 65 system electronics circuitry. 

display 66 and peripheral slots 68. Asecondary or expansion Referring to FIG. 4, the conventional processor power 

bus may be coupled by system bridge 74 to the system bus management modes for an Intel architecture processor are 
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illustrated. The blocks 110-114 correspond to the ACPI for process variations and the gate overdrive voltage. The 

Specification processor power consumption states C1-C3 low supply voltage level may then be made up of the sum of 

respectively. In the Intel architecture processors, the CI the threshold voltage, the process variation adjustment, the 

mode is called the slop and grant/autohalt mode. In this state gate overdrive voltage and the guard band voltage for the 

most functional blocks of the processor are turned off by 5 powr supply. In one exemplary embodiment, the threshold 

disabling clocks. The phase locked loop (PLL) and the voltage may be 350 mV; the threshold voltage variation may 

global clock spine are left running. Some functional blocks ■» 5 ° m ^ lhe f te overdrive voltage may be 100 mV and the 

T~ . Jl trt Clt# ™ rt k, 1c 71: .„ H • „ guard band voltage may be 50 mV, resulting in a low supply 

are left running to support bus activity and snooping. * rf 550 mV 

In block 112, called "sleep" mode in desktops and "quick- [q ^ me low , volu ^ less uaQ lwice a 

start" in mobile computer systems using Intel architecture, io wol ^ c j Q 0 ne advantageous embodi- 

all the functional blocks and most of the input/output mcnl thc tow voltagp may ^ lcss than or equal to 

devices are turned off. The PLL, global clock spine and a about ^qq m v. In another advantageous embodiment, the 

few I/O devices are kept running. low supply voltage is about 200 mV above a transistor 

In block 114 the ACPI processor power consumption state threshold voltage. Also it may be advisable in some embodi- 
es (also called the deep sleep state in the Intel architecture) 15 ments to reduce the leakage power consumption to less than 
is illustrated. In this state, the PLL is off. All the functional or equal to 25% of what it would be at a conventional supply 
blocks stop running, but the registers and caches maintain voltage level for a given technology, 
their contents. In some processors, including Intel architecture 

Referring now to FIG. 5, the transition to a modified deep processors, when the internal processor voltage tails below 

sleep state corresponding to a modified state C3 of the ACPI 20 a certain level, the CPUPWRGD signal is deasserted. When 

Specification is illustrated. Initially a check at diamond 116 the CPUPWRGD signal is asserted again, the processor 

determines whether a transition to the C3 state is appropri- automatically starts a reset sequence. It is desirable to avoid 

ate. This may occur based on inactivity or other triggering the reset sequence because reset destroys the processor 

events as set forth for example in the ACPI Specification. register contents. 

Normally the transition to the C3 state would occur in a The power good (PWRGOOD) signal generated in Intel 

device which is already in the C2 state. When the transition architecture processors is the logical AND of the external 

occurs, the processor clock may be restarted, as indicated in power good and the internal power good signals. There is a 

block 120. This may be accomplished by asserting power good input pin on Intel architecture processors. The 

G_STPCLK#. Since the clock was stopped before entering 3Q power good signal is not latched and enables the PLL 

the C3 state, it is restarted to flush the caches. Next the circuitry. The internal power good signal, generated from 

caches, such as the L2, LI and BTB caches, are flushed as voltage detection circuitry 24, trips when the internal supply 

indicated in block 122. voltage (Vcc) is below a certain voltage level, which then 

The reason for flushing the caches is to reduce the soft shuts off the PLL. Since the internal supply voltage (Vcc) is 

error rate (SER). A soft error occurs as a result of particles, 35 raised before the PLL starts to lock to thc external clock, the 

such as alpha particles, which cause charges to be formed in power good signal will be normal before the PLL starts to 

semiconductor devices. With relatively low supply voltages, lock to the external clock. 

the rate of soft errors increases dramatically. Therefore a There are at least two ways to prevent glitching of the 

flush instruction is used to flush caches such as the L2 cache, power good signal during the modified deep sleep state. The 

the LI cache and the BTB. After the caches are flushed, as ^ internal power good may be gated by a signal which is the 

indicated in block 122, the clock may then be stopped and logical AND of a signal indicating whether the computer has 

the PLL is shut off, as indicated in block 124. This may be a modified deep sleep state and a signal indicating that the 

accomplished by deasserting G_STPCLK#. modified deep sleep state has been entered. 

Next the supply voltage transitions to a low supply Alternatively, the Vcc detector 24 may be disabled (for 

voltage as indicated in block 126. This may be done by 45 example by asserting the PLLIDDQ pin 22 in Intel® 

asserting the low level of VR_LO/HI#. Generally the sup- processors) during the modified deep sleep state. A disable 

ply voltage of the device is lowered to 100 mV to 150 mV signal (e.g., PLLIDDQ) may be used to disable the Vcc 

above the threshold voltage (including an adjustment for detector during a D.C. current test for shorts, called an 

process variation) at room temperature of the a -type metal IDDQ test This Vcc detector disable signal may also be 

oxide semiconductor (NMOS) or p-type metal oxide semi- 50 used to disable the Vcc detector to avoid the reset when the 

conductor (PMOS) transistors, whichever is larger. The voltage is deliberately lowered to reduce leakage power 

supply voltage transitions in response to a signal that imple- consumption. Once the Vcc detector is disabled, the reset is 

ments the transition to the C3 processor state. This transition not generated when the voltage level is decreased, 

signal causes the low reference signal to control the voltage Some I/O circuitry may need to use a separate power rail 

regulator 52. 55 io order to avoid triggering a false signal event when the 

The exact level of the low supply voltage depends on the supply voltage is lowered. I/O signals like data and address 
threshold voltage spread and the process control with a do not need to have a separate power rail because they are 
particular technology. A low supply voltage is used that is not sampled during the modified deep sleep state, 
sufficient to maintain all the internal registers of the device Through the techniques described herein leakage power 
so that they retain their contents while avoiding floating 60 consumption may be reduced through reduced leakage cur- 
nodes, rent and supply voltage, since leakage power is a function of 

Thus, an adjustment due to process variations may be leakage current times supply voltage. Due to the sub linear 

added to the threshold voltage. Agate overdrive voltage may dependency of leakage current on voltage, the leakage 

be added to that voltage to avoid floating nodes. Finally a power consumption has a square power dependency on 

guard band voltage for the power supply may be included, 65 voltage. Thus at sufficiently low supply voltages, the reduc- 

which in an exemplary embodiment may be 10% of the sum tion of supply voltages during the modified deep sleep state 

of the threshold voltage, the threshold voltage adjustment may significantly reduce power consumption. 
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Latency may be minimized compared to a suspend to the leakage power consumption at said second level is 25% 

RAM state. Exiting the modified deep sleep state may take or less of that at the first level 

about 1 to 2 milliseconds while the latency of exiting the 9. The method of claim 1 wherein said device includes a 
suspend to RAM state may be 10 to 20 seconds. processor and wherein reducing the leakage power con- 
In addition, the soft error rate is reduced because caches 5 sumption includes reducing the supply voltage to less than 
are flushed before entering the modified deep sleep state. twice the threshold voltage of a transistor in said processor. 
Since pipelines are drained before entering the modified 10. The method of claim 1 wherein the step of reducing 
deep sleep state, latches and domino circuits may be free of the leakage power consumption includes reducing the sup- 
soft errors. In at least some embodiments, the only storage ply voltage to 600 mV or less. 

elements that are still susceptible to soft errors in the 10 u. The method of claim 1 wherein said device is a 

modified deep sleep state are the registers. Thus the soft processor and wherein said processor has a supply voltage, 

error rate may be reduced. said method further including preventing the processor from 

Referring now to FIG. 6, the processor may exit the deep undergoing a reset sequence when the supply voltage is 

sleep state in response to a sleep transition signal, indicated increased. 

at diamond 154. In response to the transition signal, for 15 <y> A mc thod for implementing a reduced processor 

example triggered as set forth in the ACPI Specification, the consumption mode in a processor including system 

supply voltage may be increased using the higher voltage caches ^ including transistors with threshold voltages, 

reference as ^^^ r b ^^ fo ^ x ^^ b y said processor further including a processor clock and a 

ing the high level of VR _Lp/HI#. The PU. may be restarted t0O £ SDT m ly volt ™ ^ me thod comprising: 

and locked to the external clock as indicated in block 158 for 20 yvy ^ ' p R 

example by asserting G_STPCLK#. Once the PLL is locked flushing the system caches; 

to the external clock, the internal clock 30 may be started as stopping the processor clock; and 

indicated in block 160. reducing the processor supply voltage to less than or equal 

While the present invention has been described with ^ to approximately 600 mV. 
respect to a limited number of embodiments, those skilled in 13. The method of claim 12 wherein reducing the pro- 
the art will appreciate numerous modifications and varia- cessor supply voltage includes reducing said voltage to less 
tions. It is intended that the appended claims cover all such than or equal to approximately 600 mV. 
variations and modifications as fall within the true spirit and 14. The method of claim 12 wherein reducing the pro- 
scope of the present invention. ^ cessor supply voltage includes reducing the supply voltage 

What is claimed is: to about 200 mV above said transistor threshold voltage. 

1. A method for operating an electronic device, said 15. A method for operating an electronic device, said 
device using a clock signal, consuming power, having a device using a clock signal, consuming power, having a 
supply voltage and having a transistor having a threshold supply voltage and having a transistor having a threshold 
voltage, said method comprising: 35 voltage, said method comprising: 

disabling the clock signal; and disabling the clock signal; 

reducing the leakage power consumption of the device reducing the leakage power consumption of the device 

comprising reducing the supply voltage to about 200 comprising reducing the supply voltage to about 200 

m V over a transistor threshold voltage. mV over a transistor threshold voltage; and 

2. The method of claim 1 wherein reducing the leakage 40 increasing the supply voltage then restarting the processor 
power consumption includes reducing a voltage of at least phase locked loop and then restarting the processor 
one component to lower leakage power consumption of the clock. 

device. 16. A method for implementing a reduced processor 

3. The method of claim 1 wherein reducing the leakage power consumption mode and a processor including system 
power consumption includes flushing at least one cache. 45 caches and including transistors with threshold voltages, 

4. The method of claim 3 wherein said device includes a said processor further including a processor clock and a 
clock and wherein reducing the leakage power consumption processor supply voltage, said method comprising: 
includes restarting the clock before flushing a cache. flushing the system caches; 

5. The method of claim 4 wherein reducing the leakage . . 

power consumption includes disabling the clock signal after so sto ^ P*** 550 ' ckx *» , t 

flushing a cache. reducing the processor supply voltage to less than twice 

6. The method of claim 1 wherein reducing the leakage *e threshold voltage of one of the transistors; and 
power consumption includes reducing a processor supply increasing the supply voltage, restarting the processor 
voltage. phase locked loop and restarting the processor clock. 

7. The method of claim 1 wherein reducing the leakage 55 17. The method of claim 16 further including preventing 
power consumption includes reducing the supply voltage to the processor from undergoing a reset sequence when the 
a level low enough to prevent substantial leakage current. supply voltage is increased. 

8. The method of claim 7 further including reducing time 

supply voltage from a first level to a second level, such that • * * * • 
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